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Consider a continuous random pair {X, Y) whose dependence is 
characterized by an extreme-value copula with Pickands dependence 
function A. When the marginal distributions of X and Y are known, 
several consistent estimators of A are available. Most of them are vari- 
ants of the estimators due to Pickands [Bull. Inst. Internal. Statist. 
49 (1981) 859-878] and Caperaa, Fougeres and Genest [Biometrika 
84 (1997) 567-577]. In this paper, rank-based versions of these esti- 
mators are proposed for the more common case where the margins 
of X and Y are unknown. Results on the limit behavior of a class 
of weighted bivariate empirical processes are used to show the con- 
sistency and asymptotic normality of these rank-based estimators. 
Their finite- and large-sample performance is then compared to that 
of their known-margin analogues, as well as with endpoint-corrected 
versions thereof. Explicit formulas and consistent estimates for their 
asymptotic variances are also given. 

1. Introduction. Let (X, Y) be a pair of continuous random variables 
with joint and marginal distribution functions defined for all x, y € M by 



respectively. Let also U = F{X) and V = G{Y), and for all u, v G M, write 
C(n, v) = V{U <u,V < v). As shown by Sklar (1959), C is the unique copula 
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H{x,y)=¥{X<x,Y<y) 
G(y)=P(y<y), 



F{x)=V{X <x) 
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for which H admits the representation 

(1.2) H{x,y) = C{Fix),G{y)} 

for ah x,y € M. The dependence between X and Y is characterized by C. 

This paper is concerned with the estimation of C under the assumption 
that it is an extreme- value copula, that is, when there exists a function 
^: [0,1] ^ [1/2,1] such that for all {u,v) G (0,1)^, 

(1.3) C{u,v) = (n7;)^^'°s(^)/l°g(™)>. 

It was shown by Pickands (1981) that C is a copula if and only if A is convex 
and max(t, 1 —t) < A(t) < 1 for all t G [0, 1] . By reference to this work, the 
function A is often referred to as the "Pickands dependence function." 

The interest in extreme-value copulas stems from their characterization 
as the large-sample limits of copulas of componentwise maxima of strongly 
mixing stationary sequences [Deheuvels (1984) and Hsing (1989)]. More gen- 
erally, these copulas provide flexible models for dependence between posi- 
tively associated variables [Cebrian, Denuit and Lambert (2003), Ghoudi, 
Khoudraji and Rivest (1998) and Tawn (1988)]. 

Parametric and nonparametric estimation methods for A are reviewed in 
Section 9.3 of Beirlant et al. (2004). Nonparametric estimation based on a 
random sample (Xi, Yi), . . . , (X„, 1^) from H has been considered succes- 
sively by Pickands (1981), Deheuvels (1991), Caperaa, Fougeres and Genest 
(1997), Jimenez, Villa-Diharce and Flores (2001), Hall and Tajvidi (2000) 
and Segers (2007). In these papers, the margins F and G are assumed to be 
known, so that in effect, a random sample (F(Xi), G(Yi)), . . . , {F{Xn),G{Yn)) 
from C is available. In their multivariate extension of the Caperaa-Fougeres- 
Genest (CFG) estimator, Zhang, Wells and Peng (2008) also assume knowl- 
edge of the marginal distributions. 

In practice, however, margins are rarely known. A natural way to proceed 
is then to estimate F and G by their empirical counterparts Fn and G„ , and 
to base the estimation of C on the pseudo-observations {Fn{Xi) , GniXi)) , ■ ■ ■ , 
{Fn{Xn) , GniYn)) ■ This amounts to working with the pairs of scaled ranks. 
This avenue was recently considered by Abdous and Ghoudi (2005), but 
only from a computational point of view. No theory was provided. 

Rank-based versions of the Pickands and CFG estimators of A are de- 
fined in Section 2. Endpoint corrections in the manner of Deheuvels and 
Hall-Tajvidi are also considered, but in contrast with the case of known 
margins, they have no effect asymptotically. Weak consistency and asymp- 
totic normality of the rank-based estimators are established in Section 3. 
Explicit formulas and consistent estimates for their asymptotic variances 
are also given. Large-sample comparisons reported in Section 4 show that 
the rank-based estimators are more efficient than the uncorrected versions 
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based on the true, known margins. Extensive numerical work also suggests 
that the CFG estimator is generally preferable to the Pickands estimator. 
A small simulation study reported in Section 5 provides evidence that the 
conclusions remain valid in small samples, and a few closing comments are 
made in Section 6. 

In order to ease reading, all technical arguments are relegated to Appen- 
dices A-F. The developments rely heavily on a limit theorem for weighted 
bivariate empirical processes, which may be of independent interest. A state- 
ment and proof of the latter result are given in Appendix G. 

The following notational conventions are used in the sequel. For x,y G 
M, let X Ay = mm(x,y) and xV y = max(x,y). The arrow means weak 
convergence, and l(-E') stands for the indicator function of the event E. 
Given a univariate cumulative distribution function F, its left-continuous 
generalized inverse is denoted by . Furthermore, C([0, 1]) represents the 
space of continuous, real- valued functions on [0, 1], while £°°(>V) is the space 
of bounded, real-valued functions on the set W; both are equipped with the 
topology of uniform convergence. 

2. Estimators of the dependence function. Consider a pair {X,Y) of 
continuous random variables whose joint distribution function H has mar- 
gins F and G, as per (1.1). Assume that the unique function C implicitly 
defined by (1.2) belongs to the class (1.3) of extreme-value copulas. 

Let U = F{X) and V = G{Y). The pair {U,V) is then distributed as C. 
Accordingly, the variables S = — log U and T = — log V are exponential with 
unit mean. For all t E (0, 1), write 

^(t) = A - 

' 1-t t 

and set ^(0) = S, ^(1) = T. Note that for any t £ [0, 1] and x>0, one has 

P{e(i) >x} = F{S>{l- t)x, T > tx} 

= F{U < e-(^-*)^ F < e"*^} = e-^^W. 

Thus, ^(t) is an exponential random variable with 

(2.1) E{e(i)} = lM(t) and E{loge(t)} = - log A(t) - 7, 

where 7 = — Jq^ log(x)e~^ dx ~ 0.577 is Euler's constant. These observations 
motivate the following estimators of A. 

2.1. Pickands and CFG estimators. Let (Xi, Yi), . . . , (X„, 1^) be a ran- 
dom sample from H. For i £ {1,. . . ,n}, let Ui = F{Xi), Vi = G(Yi), Si = 
-log{Ui)=^,i{0) and Ti = - log{Vi) = , and for all tG (0,1), write 

1-t t 
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When the margins F and G are known, the estimation of A{t) for arbitrary 
t G (0, 1) can be based on the sample ^i(t), . . . i^nit)- In view of (2.1), two 
obvious solutions are defined implicitly by 

1 " 

v<u(i) = -Ee.w, 

i=l 

ii . ^ 

1=1 

The first estimator was proposed by Pickands (1981). As shown by Segers 
(2007), the second is an uncorrected form of the estimator due to Caperaa, 
Fougeres and Genest (1997). The index "u" is used to distinguish these 
estimators from endpoint-corrected versions introduced in Section 2.3. 

2.2. Rank-based versions of the Pickands and CFG estimators. When 
the margins F and G are unknown, a natural solution is to rely on their 
empirical counterparts, F„ and Gn- This leads to pairs {Fn{Xi),Gn{Yi)), . . . , 
{Fn{Xn) , Gn{Yn)) of pseudo-observations for G. To avoid dealing with points 
at the boundary of the unit square, however, it is more convenient to work 
with scaled variables Ui = nFn{Xi)/{n + l) and Vi = nGniYi)/{n + l) defined 
explicitly for every i G {1, . . . , n} by 

The pair (i7j, V^), whose coordinates are scaled ranks, can be regarded as 
a sample analogue of the unobservable pair {Ui,Vi) = {F{Xi),G{Yi)). For 
every i e {1, . . . , n} and for arbitrary t € (0, 1), let 

5i = -log{7, = e,(0), fi = -\ogV^ = ii{l), |,(t) = -^A^. 

Rank-based versions of A^y(i) and A^y*^(t) are then given by 

1 " 

(2.2) l/Al,{t) = -Y^m, 

^ i=l 

1 " 

(2.3) log^CFG(^)^_^__^l^g^.(^)_ 

1=1 

It is these estimators that are the focus of the present study. Before pro- 
ceeding, however, endpoint corrections will be discussed briefly. 
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2.3. Endpoint corrections to the Pickands estimator. The Pickands esti- 
mator does not satisfy the endpoint constraints 

(2.4) ^(0)=^(1) = 1. 

In order to overcome this defect, Deheuvels (1991) introduced the estimator 

i/<,(t) = iML(t) - (1 - t){i/<„(o) - 1} - t{i/<,(i) - 1}. 

More generally, Segers (2007) considered endpoint corrections of the form 

l/<,b = l/<u(t) - a(t){l/<u(0) - 1} - 6(t){l/<„(l) - 1}, 

where a, b:[0,l] — > M are arbitrary continuous mappings. He showed how to 
select (and estimate) a and b so as to minimize the asymptotic variance of 
the endpoint-corrected estimator at every t € [0, 1]. In particular, he showed 
that a{t) = 1 — t, b{t) = t are optimal at independence. For arbitrary A, the 
Pickands estimator with optimal correction is denoted hereafter. 

Similar strategies can be used to ensure that the rank-based estimator 
A^^ fulfills conditions (2.4). Note, however, that whatever the choice of 
mappings a and 6, the endpoint-corrected estimator is then asymptotically 
equivalent to A^^^, because 

1 " 

(2.5) l/AlM = 1ML(1) = - E log{(^ + 1) A} = 1 + 0{n-' logn). 

1=1 

An alternative correction inspired by Hall and Tajvidi (2000) is given by 

1 " 

i/A^j{t) = -Y^m. 

1=1 

where for all t € [0, 1] and z G {1, . . . , n}, 

with Si = nSi/{Si H h Sn) and Tj = nTi/{Ti H h T„). 

By construction, one has A^J(O) = A^J(l) = 1, but an additional merit 
of this estimator is that A^^[t) > t V (1 — t) for all t G [0, 1]. Note, however, 
that because A^"^ = A^ j,/A^ j.(0), this estimator is again asymptotically in- 
distinguishable from Al^^ in view of (2.5). 

2.4. Endpoint corrections to the CFG estimator. In order to meet con- 
straints (2.4), Caperaa, Fougeres and Genest (1997) consider estimators of 
the form 

logA^FG(^) ^l^g^CFG(^) _^(^)l^g^CFG(o) _ {I . pi^t)} log A^^ {I), 
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where p : [0, 1] ^ M is an arbitrary continuous mapping. They use p{t) = l — t 
as an expedient when the margins are known. The more general choice, 

log AZ^{t)= log AZ''{t)-a{t) log A^l^{0)-b{t) log AZ^'il) 

is investigated by Segers (2007), who identified the optimal functions a and 
b. The resulting estimator is hereafter denoted ^^^opt- 

When the margins are unknown, however, the correction to the estimator 
A^^^ (and hence the choices of p, a and b) has no impact on the asymptotic 
distribution of this rank-based statistic, because 

-logA^^<'{0) = -logAZ^{l) 



1 " 

— y^loglog{(n + l)/i} — / loglog(l/x) 
n^tt Jo 

0{n-^{lognf}. 



3. Asymptotic results. The limiting behavior of the estimators A^ j. and 
A^^^ (and of their asymptotically equivalent variants) can be determined 
once they have been expressed as appropriate functionals of the empirical 
copula Cn, defined for all u,v G [0, 1] by 

1 " . 

(3.1) Cn{u,v) = -y2l{U^<u,V,<v). 

Note that this definition is somewhat different from the original one given 
by Deheuvels (1979). From inequality (B.l) in Appendix B, one can see that 
the difference between the two versions is 0{n~^) as n—> oo almost surely. 
The following lemma is proved in Appendix A. 

Lemma 3.1. For every t € [0, 1], one has 

(3.2) l/<r(i)= f'Cn{u'-\u')-, 

Jo u 



(3.3) logACFG(i) = _^+ [\cn{u^-\u')-l{u>e-^)} 

Jo 



du 



ulogu 



Note that replacing Cn{u^~\u^) by C(t(^-*, it*) = in (3.2) and (3.3) 
yields 1/A{t) and log^(t), respectively. Therefore, both A^^ and A^\'^ can 
be expected to yield consistent and asymptotically unbiased estimators of 
A. More generally, the asymptotic behavior of the processes 

Al, = n'^\Al,-A) and = nV2(^CFG _ ^) 

is a function of the limit, C, of the empirical copula process 
(3.4) Cn = n^'^{Cn-C). 
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3.1. Limiting behavior of A^^ and ^4^^*^. As shown under various con- 
ditions by Riischendorf (1976), Stute (1984), Fermanian, Radulovic and 
Wegkamp (2004) and Tsukahara (2005), the weak Umit C of the process 
Cn is closely related to a bivariate pinned C-Brownian sheet a, that is, a 
centered Gaussian random field on [0, 1]^ whose covariance function is de- 
fined for every value of u, v, u' , v' € [0, 1] by 

cov{a{u, v),a{u' ,v')} = C{u Au' ,v A v') — C{u, v)C{u', v'). 

Denoting Ci{u,v) = dC{u,v) / du and C2{u,v) = dC{u,v)/dv, one has 

C(n, v) = a{u, v) — Ci{u, v)a{u, 1) — C2(n, v)a{l, v) 

for every pair {u,v) E [0,1]^. The weak limits of the rank-based processes 
A^^ and A^^'^ are then, respectively, defined at each t G [0, 1] by 

(3.5) Af{t) = -A'^{t)[\{u^~\u')—, 

Jo u 

(3.6) A,CFG(i) = ^(i) f'c{u'-',u')^^. 

Jo u log u 

This fact, which is the main result of the present paper, is stated formally 
below under the assumption that A is twice continuously differentiable. This 
hypothesis could possibly be relaxed, but at the cost of an extension of the 
strong approximation results in Stute (1984) and Tsukahara (2005). 

Theorem 3.2. If A is twice continuously differentiable, then Af^^ ~^ 
and A^^*^ — ^ Ap^^ as n—f oo in the space C([0, 1]) equipped with the topology 
of uniform convergence. 

This result, which is proved in Appendix B, is to be contrasted with 
the case of known margins, where one has access to the pairs {Ui,Vi) = 
{F{Xi),G{Yi)) for ah z G {1, . . . , n}. As shown by Segers (2007), the estima- 
tors and A^^ are then of the same form as in (3.2) and (3.3), but 
with Cn replaced by the empirical distribution function 

1 " 

Cn{u,v) = -y2l{U^<u,V^<v). 

1=1 

The asymptotic behavior of the estimators is then as in Theorem 3.2, but 
with the process C in (3.5) and (3.6) replaced by the C-Brownian sheet 
a. In what follows, the weak limits of the processes n^/^(A^^ — A) and 
n^/^(j4^^*^ — A) are denoted A^ and A^^*^, respectively. 
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3.2. Asymptotic variances of A^^. and Fix u,v,t ^ [0, 1] and let 

a{u, v; t) = cov{C(u^-*, u*), C(w^-*, v^)}. 
In view of Theorem 3.2, the asymptotic variances of the estimators A^ ,.{t) 
and of A{t) are given by 

du dv 



vai (t) = A^(t) [ [ a(u,v;t) — — 

Jo Jo u V 

varAp^^(t) = ^2(t) / / a{u,v;t 

Jo Jo 



du dv 



ulogu flog?; 

Closed-form expressions for the latter are given next in terms of 

fi{t) = A{t) - tA{t), u{t) = A{t) + (1 - t)A{t), 

where A(t) = dA{t)/dt E [—1, 1] for all t £ (0, 1). The asymptotic variance of 
A^^'^ also involves the dilogarithm function L2, defined for x € [—1, 1] by 

dz 



L2{x) 



log(l - z) ■ 



E 

k=l 



The proof of the following result is presented in Appendix C. 

Proposition 3.3. For t g [0, 1], let Ai{t) = A{t)/t, A2{t) = A{t)/{1 - 
t), j2{t) = 1 — fi{t) and ^{t) = 1 — iy{t). Then, var A^(t) is given by 

. r ... ,.2 2^(0/2(^)^2 (t) 2l^itMt)A^{t) 

2-Mt) + u{t)-l} -^J^^, 2A,(t)-l 

+ 2n{t)u{t)Ai{t)A2{t) f\A{s) + sAi{t) + (1 - s)A2{t) - 1}-^ ds 
Jo 

- 2fx{t)A^{t)A2{t) f[A{s) + (1 - s){A2{t) - ds 
Jo 

-2u(t)Ai{t)A2{t) j\A{s) + s{Ai{t) - l]]-^ds, 

while 74~2(t) var Ap^^(t) equals 

{l+^\t) + y\t)-i,{t)-y{t)}L2{l) 

- 2/i(t)/Z(t)L2{-l + l/A2{t)} - 2v{t)v{t)L2{-l + iMi(t)} 

t{l-t) l-^(s) 



2ti{t)u{t) / log 1 



+ 2ti{t) / log 



ds 



A{t) t{l- s) + {l-t)s] s{l-s 



ds 



+ 2v{t) / log 



1 



t{l - t) I - A{s) + s{Ai{t) - 1} 
A{t) t{l-s) + {l-t)s 

t{l -t)l- A{s) + (1 - s){A2{t) - 1} 



.(1 



A{t) 



t{l -s) + {l-t)s 



ds 



sil 
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As stated below, great simplifications occur at independence. The proof 
of this result is given in Appendix D. 



Corollary 3.4. IfA=l, then ^ = v = 1, and for all t ^[Q, I], 

, 2,t{l-t) 
var A, (t) - ^ ' 



{2-t){l + ty 
var Ap^'^(t) = 2L2(-1) - 2L2(t - 1) - 2L2(-t). 



3.3. Consistent estimates of the asymptotic variances. Proposition 3.3 
can be used to construct consistent estimates of var Aj'(t) and var A^^^(i) 
for arbitrary t G [0, 1]. The latter is useful, for example, for the construction 
of asymptotic confidence intervals for the Pickands dependence function. 

Specifically, suppose that {An) is any sequence of consistent estimators 
for A, that is, suppose that — j4|| ^ in probability as n — > oo, where 
II ■ II is the supremum norm on C([0, 1]). Put 

An = greatest convex minorant of [An A 1) V / V (1 — /), 

where I denotes the identity function. One can then invoke a lemma of Mar- 
shall (1970) to deduce that \\An — A\\ < \\An — A\\ and, hence, that (An) is 
also a consistent sequence of estimators. Furthermore, An is itself a Pickands 
dependence function. See Fils-Villetard, Guillou and Segers (2008) for an- 
other way of converting a pilot estimate An into a Pickands dependence 
function. 

Let A'n denote the right-hand side derivative of An ■ Because An is convex 
for every n G N, it is not hard to see that if A is continuously differentiable 
in t E (0,1), then An{t) — A{t) in probability as n — > oo. Consequently, if 
A is replaced by An in Proposition 3.3, it can be seen that the resulting 
expressions converge in probability. In other words, 

n var A^^^ (t) var Af (t) and n var ^^J^ (t) var A^^^ {t) . 

4. Efficiency comparisons. Which of the rank-based estimators A^ ^ and 
An^'^ is preferable in practice? When the margins are known, how do they 
fare compared with their uncorrected, corrected and optimal competitors? 
These issues are considered next in terms of asymptotic efficiency. 

Figure 1 summarizes the findings, based either on mathematical deriva- 
tions or on numerical calculations. In the diagram, an arrow Ei E2 be- 
tween estimators E\ and E2 means that the latter is asymptotically more 
efficiency than the former, that is, cr\ (t) < a'^ (t) for all t € [0, 1]. 



10 



C. GENEST AND J. SEGERS 



P-opt < — P-c < — P-u — > P-r 

i* i* T 

CFG-opt < — CFG-c i — CFG-u — > CFG~r 

Fig. 1. Comparisons of estimators. An arrow E\ — » E2 between estimators Ei and E2 
means that the latter is asymptotically more efficiency than the former, that is, < a%_^ . 
Notation: P = Pickands; CFG — Caperaa-Fougeres-Genest; — u — known margins, uncor- 
rected; — c — known margins, endpoint- corrected; —opt — known margins, optimal correc- 
tion; — r = rank-based. No arrow can be drawn between P-r and CFG-r, between P-c and 
P-r, or between CFG-c and CFG-r. An asterisk (*) marks a conjecture based on extensive 
numerical computations. 



4.1. Uncorrected versus corrected estimators. For all u,v,t(z [0,1], let 

(4.1) ao{u, V- 1) = coY{a{u^~\v'),a{v^~\v')} = {u ^ z;)^^ - {uv)^^'\ 

It follows from the work of Segers (2007) that the asymptotic variances of 
the raw estimators A^^^{t) and A'^^^{t) are given by 



varAP(t) = A^(t) 



/ / ao{u,v;t) = A\t), 

Jo Jo u V 



varA^^^ {t)=A\t) [' f\,(u,v;t)^ = L.m^t), 

Jo Jo nlogn vlogv 

respectively. As ^2(1) ="^^76, Pickands' original estimator is more ef- 
ficient than the uncorrected CFG estimator, A^^, that is, CFG-u —>■ P-u. 

Formulas for the asymptotic variances of the endpoint-corrected versions 
A^^ and A^^'^ are more complex. They can be derived from the fact, also 
established by Segers (2007), that for all choices of continuous mappings 
a, b : [0, 1] M, the weak limit A^j^ of the process n^^'^{A^ — A) satisfies 

(4.2) APjt) =AP(t) -a(t)AP(0) -6(t)AP(l) 

for all t e [0, 1]. A similar result holds for the limit of ?T-^''^(^n,opt ~ 

Using these facts, one can show that asymptotic variance reduction results 
from the application of the endpoint correction a{t) = 1 — t, h{t) = t, both 
for the Pickands and CFG estimators. In other words, one has P-u P-c 
and CFG-u — > CFG-c. This fact is formally stated below. 

Proposition 4.1. For all choices of A and t e [0, 1], one has 
var A^{t)< var A^ (t) and var A^^*-^ (t) < var A^^^^ (t) . 

The proof of this result may be found in Appendix E. Of course, it is 
trivial that P-c ^ P-opt and CFG-c CFG-opt. 
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4.2. Rank-based versus uncorrected estimators. Somewhat more surpris- 
ing, perhaps, is the fact that the rank-based versions of the Pickands and 
CFG estimators are asymptotically more efficient than their uncorrected, 
known-margin counterparts, that is, P-u P-r and CFG~u — > CFG-r. 
This observation is a consequence of the following more general result, whose 
proof may be found in Appendix F. 



Proposition 4.2. For all choices of A and u,v,te [0, 1], one has 
cov{C{u^~\u^),Civ^-\v^)} < cov{a{u^-\u*),a{v^-\v^)}. 



Consequently, the use of ranks improves the asymptotic efficiency of any 
estimator of A whose limiting variance depends on ctq as defined in (4.1) 
through an expression of the form 

/ / ao{u,v;t)f{u,v)dudv, 
Jo Jo 

where / is nonnegative. See Henmi (2004) for other cases where efficiency 
is improved through the estimation of nuisance parameters whose value is 
known. Typically, this phenomenon occurs when the initial estimator is not 
semiparametrically efficient. Apparently, such is the case here, both for A^^ 
and A^^ . As will be seen below, however, the phenomenon does not persist 
when endpoint-corrected estimators are considered. 



4.3. Ranked-based versus optimally corrected estimators. Although effi- 
ciency comparisons between ranked-based and uncorrected versions of the 
Pickands and CFG estimators are interesting from a philosophical point 
of view, endpoint-corrected versions are preferable to the uncorrected es- 
timators when margins are known. For this reason, comparisons between 
rank-based and corrected estimators are more relevant. 

To investigate this issue, plots of the asymptotic variances of 

aP aP aP aCFG /.CFG aCFG 

n,c' n,opt' n,r' n,c ' n,opt' n,r 

were drawn for the following extreme- value copula models: 

(a) The independence model, that is, A{t) = 1 for all t € [0, 1]. 

(b) The asymmetric logistic model [Tawn (1988)], namely, 

Ait) = (1 - vi)(i - 1) + (1 - i;2)t + [(Vit)'/' + {V'2(i - t)}'/y 

with parameters 9 G (0, 1], ipi,ip2 G [0, 1]. The special case V'l = V'2 = 1 
corresponds to the (symmetric) model of Gumbel (1960). 
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(c) The asymmetric negative logistic model [Joe (1990)], namely, 

A{t) = 1 - [{M^ - t)r'/' + {iP2t)-'/']-' 

with parameters 6 € (0, oo), V'iiV'2 £ (0, 1]. The special case ■01 = ^2 = 1 
gives the (symmetric) negative logistic of Galambos (1978). 

(d) The asymmetric mixed model [Tawn (1988)], namely, 

A{t) = l-{e + K)t + et^ + Kt-^ 

with parameters 9 and k satisfying ^>0, + 3k>O, 9 + k< 1, 9 + 2k< 
1. The special case k = and 9 G [0,1] yields the (symmetric) mixed 
model [Tiago de Oliveira (1980)]. 

(e) The bilogistic model [Coles and Tawn (1994) and Joe, Smith and Weiss- 
man (1992)], namely, 

A{t)= / max{{l- p)w-'^{l-t),{l-5){l-w)-h}dw 
Jo 

with parameters (/3, 6) G (0, 1)^ U (-oo, 0)^. 

(f) The model of Hiisler and Reiss (1989), namely, 

^(,) = (1 - t)* (a + i log i^) + (a + i log ^) , 

where A G (0, oo) and $ is the standard normal distribution function. 

(g) The t-EV model [Demarta and McNeil (2005)], in which 

A{w) = wt^+iizw) + (1 - w)t^+i{zi_i,), 

z^ = {i + x)'/'[W(i - w)}'/^ -p]il- p')-'/' 

with parameters X > ^^id P £ (— 1)) where i^+i distribution 
function of a Student-t random variable with x + 1 degrees of freedom. 

Figure 2 corresponds to the case of independence. One can see from it that 
when A = l, the rank-based versions of the Pickands and CFG estimators are 
more efficient than the corresponding optimal, endpoint-corrected versions, 
even though the latter use information about the margins. As illustrated in 
Figure 3, however, the rank-based estimators are not always superior. The 
paradoxical phenomenon mentioned in Section 4.2 thus vanishes. Further, 
note the following: 

(a) When A is symmetric, one would expect the asymptotic variance of an 
estimator to reach its maximum at t= 1/2. Such is not always the case, 
however, as illustrated by the t-EV model. 

(b) In the asymmetric negative logistic model, the asymptotic variance of 
the rank-based and optimally endpoint-corrected estimators is close to 
zero for all t G [0,0.3]. This is due to the fact that A{t) f« 1 — t on this 
interval when 9 = 1/10, 1^1 = 1/2 and ^2 = ^ hi this model. 
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Fig. 2. Graph, as a function of t, of the asymptotic variances of the estimators 
^n,c(t) = ^n,opt(t), A^,i{t), A';^%pt{t) and ^^^^(t) in case of independence, 

A = l. 



4.4. Comparison between the Pickands and CFG estimators. In view of 
Figure 3, neither of the two rank-based estimators dominates the other one 
in terms of asymptotic efficiency. In most cases, however, the CFG estimator 
is superior to the Pickands estimator. In the figure, A^^^^ is also seen to be 
systematically more efficient than ^J^^opt- This observation was confirmed 
for a wide range of models and parameter values through extensive numer- 
ical studies. It was also seen to hold for the endpoint-corrected estimators 
A^^ and A^^'^ originally proposed by Deheuvels (1991) and by Caperaa, 
Fougeres and Genest (1997), respectively. It may be conjectured, therefore, 
that 

P-c A CFG~c and P-opt A CFG~opt. 

5. Simulations. A vast Monte Carlo study was used to confirm that the 
conclusions of Section 4 remain valid in finite-sample settings. For brevity, 
the results of a single experiment are reported here for illustration purposes. 

Specifically, 5000 random samples of size n = 100 were generated from 
the i-EV copula with one degree of freedom and various values of p cho- 
sen in such a way that the tail coefficient 2{1 — ^(0.5)} ranges over the set 
{i/W:i = 0, . . . , 10}. For each sample, the Hall-Tajvidi and the CFG esti- 
mators were computed when the margins are known and unknown. For each 




Asymmetric Mixed (thieta = 1 .33, eta = -0.33) Biiogistic (beta = 0.9, delta = 0.1 ) 




0.0 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1.0 



Hiisier-Reiss (iambda = 1) t-EV (chi = 0.2, rho = 0.8) 




0.0 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1.0 



Fig. 3. Graph, as a function oft, of the asymptotic variances of the estimators A^^{t), 
^n,opt(i)) An.i{i), A'n^c'ii), ^n.opt (*) '"^'^ ^rwr^ (i) Jor six cxtrcme-value copula models. 
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t-EV copula — margins unknown 
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2{1-A(0.5)} 
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Fig. 4. MISE(xlOO) of the estimators A^'^ and A^^^ with correction p(t) = l~t, based 



on 5000 samples of size n = 100 from the t-EV copula with x = 1 
p G [—1, 1] chosen in such a way that 2{1 — A{0.5)} G {i/W :i = 0,. . 
gins are either known (left) or unknown (right). 



legree of freedom and 
. , 10}, when the mar- 



estimator, the empirical version of the mean integrated squared error, 



MISE = E 



f\An{t) - A{t)}'' dt 
Jo 



was computed by averaging out over the 5000 samples. 

In Figure 4, the normalized MISE (i.e., multiplied by n) is plotted as 
a function of 2{1 — A{0.5)}. Similar results were obtained for many other 
extreme-value dependence models. In all cases, the CFG estimator was su- 
perior to the Hall-Tajvidi estimator. In addition, the use of ranks led to 
higher accuracy, although the gain diminished as the level of dependence 
increased. In the case of perfect positive dependence, C{u,v) = u A v, the 
MISE of all estimators vanishes; indeed, it can be checked readily that their 
rate of convergence is then Op(n~^/^). 

6. Conclusion. This paper has proposed rank-based versions of several 
nonparametric estimators for the Pickands dependence function of a bivari- 
ate extreme-value copula. The new estimators were shown to be asymptot- 
ically normal and unbiased. Explicit formulas and consistent estimates for 
their asymptotic variances were also provided. 

In contrast with the existing estimators, the new ones can be used regard- 
less of whether the margins are known or unknown. It is interesting to note 
that the rank-based versions are generally more efficient than their uncor- 
rected counterparts, even when the margins are known. In practice, however, 
only the endpoint corrected versions would be used. While the rank-based 
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estimators no longer have a distinct advantage, they clearly remain com- 
petitive. The extensive numerical work presented herein also suggests that 
the CFG estimator is preferable to the Pickands estimator when (possibly 
optimal) endpoint corrections are applied to both of them. 

A solution to this conjecture would clearly be of practical importance. In 
future work, it would also be of interest to assess more fully the impact of 
sample size, level and form of dependence (e.g., symmetry or lack thereof) 
on the precision of the various rank-based estimators proposed herein. Mul- 
tivariate extensions could also be developed along similar lines as in Zhang, 
Wells and Peng (2008). Finally, the problem of constructing a truly semi- 
parametric efficient estimator of a Pickands dependence function remains 
open. 



To express the Pickands estimator in the form (3.2), use Definition (2.2) 
and the substitution u = to get 



APPENDIX A: PROOF OF LEMMA 3.1 




i=l 





In the case of the CFG estimator, the substitution u = e ^ yields 






Now, in light of Definition (2.3), one can write 



-7-logA^J^(t) = -Elog|.(t) 
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Further, note that ^j(t) > — log(n) if and only if —logUi > —(1 — t) log(n) 
and — log Vi > — i log(u). The rest of the argument is along the same lines as 
in the case of the Pickands estimator. 

APPENDIX B: PROOF OF THEOREM 3.2 

Before proceeding with the proof, recall that in Deheuvels (1979) the 
empirical copula is defined for all u,v G [0, 1] by 

in terms of the empirical marginal and joint distribution functions En, Gn 
and Hn of the sample {Xi,Yi), . . . , (Xn,Yn)- Elementary calculations imply 
that, in the absence of ties (which occur with probability zero when the 
margins are continuous), the difference between and the empirical copula 
Cn in (3.1) is asymptotically negligible, namely, 

(B.l) sup \C^{u,v)-Cn{u,v)\<-. 

(w,t))e[o,i]2 

Let Cn be the empirical distribution function of the (unobservable) ran- 
dom sample {Ui,Vi), . . . , (Un, Vn), and set On = 'n}/'^{Cn — C). Further, define 
the (random) remainder term Rn{u,v) implicitly by 

n^/^Cniu,v)-Ciu,v)} 

(B.2) 

= an{u,v) - Ci{u,v)an{u,l) - C2{u,v)an{l,v) +Rn{u,v). 

Now if Cn is replaced by C^ in the left-hand side of (B.2), results in Stute 
[(1984), page 371] and Tsukahara [(2005), middle of page 359] imply that 

(B.3) sup ji2„(n,z;)| =0{n-i/^(logn)^/2^1oglogn)^/^} 

almost surely as oo, provided that the second-order partial derivatives 
of C exist and are continuous. This condition is automatically satisfied if A 
is twice continuously differ entiable, and hence in view of (B.l), the bound 
(B.3) remains vahd for Rn defined via Cn. 

Proof of Theorem 3.2. 

Pickands estimator. Recall the empirical copula process Cn defined in 
(3.4), and for every t € [0, 1], let 



<rW=^'/'{l/<rW-lMW}- 
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It will be shown that , --^ B = -Af j as n — > CO, which implies that 

as a consequence of the functional version of Slutsky's lemma given in van 
der Vaart and Wellner [(1996), Examples 1.4.7, page 32]. 
Put kn = 21og(n + 1) and use identity (3.2) to see that 



Cn{u^-\u') - = r C„(e-(i-*) , e-*) ds. 
Jo u Jo 



One can then write M^^{t) = /i,n(i) + h,n{t), where for each t G [0, 1], 



Note that the contribution of /i,n(i) is asymptotically negligible. For, if 
s > kni then e~''(^~*^ A e~'^* < l/(n + 1), and hence 

C„(e-^(i-*),e-^*) = -ni/2C(e-^(i-*),e-^*) = -n^/^e'^^W. 

Thus, for ah t G [0,1], 

(B.4) \h,nit)\ = n'/' re-^Wd.<nV2 T e^/^ = ^ < 

Jk„ Jk„ n + l nV^ 

Consequently, the asymptotic behavior of B^j. is determined entirely by l2,n- 
In turn, one can use Stute's representation (B.2) to write l2,n = Ji,n + • ■ • + 
Ji^n, where for each t e [0, 1], 

Ji,n(i)=/ an{e-'^'-'\e-'')ds, 



J2,„(i) = - / anie-<'"'\l)Ci{e-'^'-'\e-'^')ds, 











fkn 

J4,n(i)=/ iin(e-^(^-*\e-^*)ds. 



From (B.3), the contribution of Ji^n(t) becomes negligible as n — > oo, 
because with probability one, 

sup |J4,„(t)| =0{n-i/'^(logn)3/2(loglogn)i/^}. 
ie[o,i] 

Accordingly, the identity l2,n{t) = Ji.nif) + J2,n{f) + •h,n{t) + o(l) holds al- 
most surely and uniformly in t € [0, 1] when oo. 
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To complete the proof, it remains to show that the terms Ji^n{t) with 
i = 1, 2, 3 have a suitable joint limit. To this end, fix to € (0, 1/2) and write 
q^{t) = t'^(l - t)'^ for all t G [0, 1]. Let also Gn,ujiu,v) = an{u,v)/q^{u A v) 
for all n, -y S [0,1]. A simple substitution then shows that each Ji^n{t) is a 
continuous functional of the weighted empirical process G„^(^, namely, 

(B.5) Ji,n{t)= GnAe"^ \e-'')Kiis,t)ds, 

Jo 

(B.6) J2,n{t) = - r"G„,,(e-'^(^-*),l)K2(s,t)ds, 

Jo 

(B.7) J3,„(t) = - [''"GnAhe-'')K3is,t)ds 

Jo 

with Ki{s,t) = g^(e-^(i-*) A e-^*), K2{s,t) = g^(e"^(i-*))Ci(e-''(i-*), e"^*) 
and /s:3(s,t) =ga;(e-'*)C'2(e-^(^-*),e-**) for allsG(0,(X)) andte[0,l]. 

The conclusion will then follow from Theorem G.l and the continuous 
mapping theorem, provided that for i = 1,2,3, there exists an integrable 
function K* : (0, oo) ^ M such that Ki{s, t) < K*{s) for aU s and t. For Ki , 
this is immediate because Ki{s) < qu;{e-'/^) < e-'^'/'^. For K2, note that 

Ci(e-^(i-*),e-^*) = e-^^^W-(i-*)>^*(t), 

where /i(t) = A{t) - tA{t). Recalling also that A{t) > t V (1 - t), one finds 

K2{s,t) < ;u(t)e-^{^W"(^~^)(i-*)> < ;tf(t)e-^^/2_ 

As a similar argument works for K^, the proof is complete. 

CFG ESTIMATOR. The argument mimics the proof pertaining to the 
Pickands estimator. To emphasize the parallel, the same notation is used 
and the presentation focuses on the changes. 

In view of Lemma 3.1 and the functional version of Slutsky's lemma, the 
process to be studied is given for all t G [0, 1] by 

= /'c„(ni-*,u*)-^ = - rC.(e-(^-*),e-*)^. 
JO ulogii Jo s 

This process can be decomposed as — (/i,n + l2,n + l3,n), where 
hAt)= rC„(e-(^-*),e-*)-, 

Jkn S 
Jin S 

hAt)= rC.(e-(^-*),e-^*)- 
JO s 
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with kn = 21og(n + 1) as above and ^„ = l/(n + 1). 

Arguing as in (B.4), one sees that |/i,n| ^ Similarly, I^^n is negligible 

asymptotically. For, if s € {0,in) and t G [0,1], one has e-^(i-*) A e'"^ > 
g-i/(n+i) > i/(n + 1), and then (^^(e-^^i-*), e"**) = 1, so that 

|C„(e-^(i-*),e-^*)| < 71^/^1 - e-^^W} < n^/^sA{t). 

Therefore, |/3,„| < 7i^^^£n < n'^l'^ . 

As a result, the asymptotic behavior of B^^*^ is determined entirely by 
-^2,n- Using Stute's representation (B.2), one may then write /2,n = J\,n + 
• • • + •^4,n) where for all t G [0, 1], 

Ji,„(t)= r"a„(e-(i-*),e-*)-, 

h.nit) = - t a„,(e-^(^-*) , (e-(i-*) , e^*) - , 

j£„ S 

Jt„ s 
Again, the term J^^n is negligible asymptotically because as oo, 

|-^4,n(i)l < iog(fc„/4) sup \Rn{u,v)\ 

(«,v)e[o,i]2 
= 0{n-i/^(logn)3/2(loglogn)i/^} 

almost surely and uniformly in t G [0,1]- As for Ji^„, J2,„ and Js^m they 
admit the same representations as (B.5)-(B.7), except that in each case, 
the integration is limited to the interval (£„,fen)- For s G [l,oo), the same 
upper bounds K^, ET^, apply, and they have already been shown to be 
integrable on this domain. As for the integrability on (0, 1), it follows from 
the additional bound |1 - e~^(^~*) A e~'**h < s'^. □ 



APPENDIX C: PROOF OF PROPOSITION 3.3 
The proofs rely on the fact that for all u,?; ,t G [0, 1], 

a{u,v;t) =aoiu,v;t) + {uv)'^^^U '^ae{u,v;t) -^ai{u,v;t) \, 

u=i e=5 J 
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where ctq is given by (4.1) and 



ai{u,v;t) = (u*"^ Aw*"^ - l)^^(t), 
cj2(n, v; t) = (u"* A w"* - l)u^{t), 

C75(n, v; t) = {n-^Wt;*-iC(ui-* A v^~\u') - 
cj6(n,7;;t) = {u'-^'^^^'^ C{u^~' hv^-\v') - l]ii{t), 
cj7(n, t>; t) = {n-^W?;-*C(ni-*, u* A - l}u{t), 
cj8(n, t>; t) = {u-'v-^^'^C{v^-\ n* A - l]v{t). 

As ^^_r(0) = ^n,r(l) = 1 and A^J'^(O) = = 1 by construction, it 

may be assumed without loss of generahty that t G (0,1). It is immediate 
from the work of Segers (2007) that 



for the Pickands and the CFG estimators, respectively. Various symmetries 
help to reduce the computation of the remaining terms from eight to three 
for both estimators. In particular, note that ai{u,v;t) = crj+i(v,n;t) for i = 
3, 5, 7 and all t € [0, 1]. Furthermore, 

a2{u,v;t) = ai{u,v;t), a7{u,v;t) = a5{u,v;t), 

where a bar over a function means that all instances of ^, t, fi and u in it 
should be replaced by 1 — A, 1 — t, 1 — /i and 1 — i', respectively. Thus, if 




and 





/p(n,z;) = (H^W-i, 
fcFG{u,v) = (u^;)^W~i 



/{log('u) log(t;)} 



one has f{v,u) = f{u, v) = f{u, v), as weh as 





{(Ti(n, v; t) + ai{u, v; t) + 2(73 (n, t)}f{u, v) dudv 
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and 



8 pI rl 

/ / <Ji{u,v;t)f{u,v)dudv 
i=5-^o Jo 



1 rl 

{2a5{u,v;t) + 2a5{u,v;t)}f{u,v) dudv. 

^0 

Each of the relevant parts is computed in turn. 
Integrals involving ai. For the Pickands estimator, 

1 rl 

{uAvY~\uv)'^'^*'>~Uudv 

JO 

(C.l) =2 [' r u'^^'^-^v'^^'^+'-^dudv 



Jo 



A{t)Jo A{t){2A{t)+t-l} 

Consequently, 

ni n^m c 2 1 



For the CFG estimator, 



JO 



ulog(ti) v\og{v) 

(C.2) 

''^ , f-^ sAif) dv du 



10 Ju vlog{v) u\og{u)' 

Use the substitution v = u^ to rewrite this expression as 

Jo log(n) X 

^ :\ ( {l+x)A{t) + {t-l)x\ dx 

-2 / lOET 



{l + x)A{t) 

-2 f\og[l + {l-l/A2{t)}x] — + 2 f\og{l + x) 
Jo X Jo 

2L2{-l + lM2(i)} + ^2(l). 



dx 

X 



Therefore 

rl rl 



f f ai{u,v;t)fcFG{u,v)dudv = fi\t)[2L2{-l + l/A2{t)} + L2{l)]. 
Jo JO 
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Integrals involving 0-3. First, consider the Pickands estimator. The 
substitutions u^~^ = x and = y yield 

"1 rl 



JO 



1 



1 /•! 



t{l-t)Jo Jo 



xy 



dxdy 



1 



1 rl 



t(l-t)Jo Jo 

Next, use the substitutions x = w^~^ and y = w^. Note that w = xy G 
(0,1], s = log(y)/log(xy) e [0,1], C{x,y) = w"^^^^ and the Jacobian of the 
transformation is —\og{w). The above integral then becomes 



1 rl 



t{l-t) 



Jo 



^Ais)+il-s){A,it)^2}+s{A,it)~2} i^g(^) ^^^^ 



= J(T^) + (1 - - 2} + s{A,{t) - 2} + l]-2d,. 

With A{s) in the integrand, no further simplification is possible. Thus, 



Jo Jo 



a3{u, v; t) fp{u, v) du dv 



[\a{s) + sAi{t) + (1 - s)A2it) - l}-2 ds. 
Jo 



^2(t) t{l-t) 

The same substitutions are used for the CFG estimator. They yield 



'•1 rirc{u^^\v^) 
Jo Jo 



y^l tyt 



du 



dv 



10 JO 



xy J 



ulog(n) vlog{v) 

dx dy 



xlog{x) ylog(y) 



JO 

'log^^^^"'^/(^"*^+'/*^^^*^"^ + ^^' 







log{w) s{l — s) 
ds 



{(l-s)/{l-t) + s/t}A{t) 



8(1 



Hence, 



1 rl 



Jo 



asiu, v; t)f CFG {u, v) du dv 







A{t) t{l - s) + (1 - t)s j s(l - s) ■ 
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Integrals involving 0-5. For the Pickands estimator, one gets 
' /' n-^Wt;*-iC7{(n A vf'\ n*}(n7;)^W — — 

JO U V 

^0 
Jo Jv 

The first integral on the right was computed in (C.l). For the second, collect 
powers in u and v and use the substitutions = x and = y. This yields 

t(r^yo Jo ^^'^'^~^y~'Cix,y)dxdy. 

Then, make the same substitutions x = w^~^, y = w^ that were used for CJ3. 
As the constraint x < y~^+^/^ reduces to s < t, the integral becomes 



1 



^{l~s){A2{t)~2}+A{s)s iQg(^) 



t{l-t) Jo 

^ - s){A2{t) - 2} + ^(s) - s + I]-' ds. 



Thus, 



1 /-i 

cr5(ii, v; t)fp{u, v) dudv 



Jo 



IJi{t) ^i{t) 



A{t){2A{t)+t-l} A^it) 



[A{s) + {l-s){A,{t)-l}r^ds. 



t{l-t) Jo 

Finally, turning to the CFG estimator, one must evaluate 
1 '"" , .A(f) du dv 



Jo ulog(n) v\og{v) 



J?; 



'ulog(n) f log(u) 



The first integral on the right was computed in (C.2). For the second, the 
same substitutions are used as for the Pickands estimator. This yields 



10 Jo xlog(x) 2/ logy 
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Jo JO 



l^^s~l~sAiit)+A{s) _ ^•^^{(l-s)/{l-t)+s/t}A{t)-l 

dw ds 



X 



log(t(;) s(l — s) 







* log - + + s - 1 - + A{s) 



{{l-s)/{l-t) + s/t}A{t) 



ds 



s{l-s) 



Therefore, 

r-l rl 



JO 



a5{u,v;t) /cFG {u,v)dudv 
: ^(t)L2{-l + l/A2{t)} + fi{t)L2il)/2 



1 



t{l - t) 1 - A{s) + s{Ai{t) - 1} 



A{t) t{l - s) + {1 - t)s 
It then suffices to assemble the various terms to conclude. 



ds 



s{l-s) 



APPENDIX D: PROOF OF COROLLARY 3.4 

When C{u,v) = uv for all n, u € [0, 1], one has A{t) = fi{t) = z/(t) = 1 for 
all t € [0, 1] . Upon substitution, one gets 

which simplifies to a{u,v;t) = u{l — v^~^){l — v^) for arbitrary u,v G (0, 1) 
with u <v. Thus, by symmetry, 



vaiAfit) = 2 (1 - - v^) dv : 



3t(l -i) 
{2-m + t) 

for all t S [0, 1]. For the CFG estimator, the substitution v = u^ yields 

"1 '■^ - dv du 



varA,c^G(t) = 2 / f {l-v^'*){l-v^) 

Jo Ju 



vlog{v) log(M) 

log(?x) X 



log{(l-t)a; + l}-log 



{l-t)x + xt + l 



1 + xt 

The latter decomposes into a sum of three integrals, namely. 



dx 

X 



/"^ / X dx f^-\ , ^dx fK , ^dx 
-2 / log(l + x) — + 2 log(l + x) — + 2 log(l + x)—, 





whence the conclusion. 
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APPENDIX E: PROOF OF PROPOSITION 4.1 

Comparison of the Pickands estimators. In view of Theorem 3.1 
of Segers (2007) and relation (4.2) with a{t) = t and b{t) = 1 — t, one has 

varA^(t) - varAu(t) = (1 - t)^ varr?(0) + varr?(l) - 2(1 - t) cov{r/(0), r?(t)} 

- 2t cov{7?(t), r/(l)} + 2t(l - t) cov{7?(0),r7(l)}, 

where rj denotes a zero-mean Gaussian process on [0, 1] whose covariance 
function is defined for all < s < t < 1 by 

,,,,,, s 1 1 - t 1 1 f*^ dw 1 

cov{,(.),^(t)} = --^ + ^-^ + ^^y^ i^-i(^- 

Upon substitution and simplification, var A^(t) — var (t) thus reduces to 
(EA) 



Because A is convex, however, one knows that for all t S [0, 1], 

dw 1 ^ dw l-t 

> —-- and / -77^ > 



A^{w) - Ait) Jt A^{w) - Ait) ■ 

Using these inequalities and the fact that the third summand in (E.l) is 
negative for all t € [0, 1], one gets 

var A^ it) - var Aj;'(t) < -1 + 2[1 - (1 - t)(l - t^) _ t{l - (1 - t)^}] ^ 



-l + 2til-t) 



Ait) 
1 



Ait) 



and this upper bound is negative because Ait) > f V (1 — t) for all t £ [0, 1]. 
Thus, the argument is complete. 

Comparison of the CFG estimators. Theorem 4.2 of Segers (2007) 
and relation (4.2) with a(t) = t and bit) = 1 —t imply that 

var A^^^(t) - var A^''^(t) 

= (1 -t)2varC(0) +t^varC(l) -2(1 -t)cov{C(0),C(i)} 
- 2t cov{C(t), C(l)} + 2t(l - t) cov{C(0), C(l)}, 
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where C denotes a zero- mean Gaussian process on [0, 1] whose covariance 
function is defined for all < s < t < 1 by 



dw 

w 



cov{C(s),C(i)} = - / logH- log(t)log(l-s)- / log(l-u;) 

Jo l-w Jt 

+ log (^^^ log ^(s) + log ( Y^l) A{t) 

2\ ^A{t)j Js w{l-w) 
Upon substitution and simplification, varA^^*^(t) — varAy^'^(t) becomes 

{{l-tf+t^}— + 2{l-t) log{l-w) — + 2t logH- 

6 Jt w Jo 1 — w 

- {log A{t)f + 2{(1 - 1) log(l - t) + 1 log(t)} log A{t) 

, /■* log^fu;) , „ logAfit;) , 
+ 2{l-t) f ^ \ dw + 2t ^ ^ [ dw 
Jo w{l - w) Jt 

log A{w) 



w{l — w) 



2t{l-t) f 
Jo 



■ dw. 



w{l — w) 

Omitting the term — {logyl(t)}^ and using the elementary inequalities 

TT^ , , , , N dw TT^ /"* , / ^ dw 

-{l-t)<- log(l -w)—, ^t<- \og{w) 



/■I ft ( 

(l-t) log{l -w) — +t logH- 
Jt w Jo I 



Jt W Jo 1 — w 

one can see that an upper bound on var is given by 

dw 
w 

+ 2{(1 - t) log(l - t) + t log(t)} log A{t) 

,o /■* logyl(-u;) , n logylfu;) , 
+ 2{l-tf ° ^ ; + 2t^ / ° ^ 

Partial integration and the fact that A{t) > 1 — t for all t € [0, 1] imply that 

pt dy^ ft dw 

/ log(u;)- = -log(t)log(l-t)+ / log(l-u;) — 

Jo I — w Jo w 

< - log(t) log(l -t)<- log(t) log A{t). 

Similarly, the fact that A{t) > t for all t € [0, 1] yields 
/■I 

/ logfl-w) — < -log(l -t)log^(t). 
Jt w 
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Therefore, a more conservative upper bound on var 
is given by 



{(l-t)log(l-t)+tlog(t)}log^(t) 

logyl(ii;) n 2 log74(ii;) 



,9 r logyi -u; , 9 r log 
+ 2{l-tf f ^ \ dw + 2^ / 

Jo w{l - w) Jt w{l - w) 



dw. 



Now A{t) e [1/2,1] and hence 2{A{t) - 1} < logyl(t) < A{t) - 1 for all 
t £ [0, 1]. Consequently, an even more conservative bound is given by 

2{(1 - t) log(l - t) + tlog{t)}{Ait) - 1} 

,9 /■* A(w) - 1 , 9 /■! A(w) - 1 , 
+ 2{l-tf ) ' dw + 2t^ ) ' dw. 
Jo w{l — w) Jt w[l — w) 

Calling on the convexity of A, however, one can show that {A^w) — l}/w < 
{A{t) - l}/t for all w G [0,t] while {A{w) - 1}/(1 -w)< {A{t) - 1}/(1 - t) 
for any w £ [t, 1]. This leads to the final upper bound, namely, 

1 — 

-{t' log(t) - (1 - tf log(l - t)}{A{t) - 1}. 



til-t) 

As the latter is easily checked to be at most zero, the proof is complete. 

APPENDIX F: PROOF OF PROPOSITION 4.2 

In view of the expressions for a{u,v;t) and ao{u,v;t) given in Appendix 
C, it suffices to show that for all u,v,t [0, 1], 

4 8 

£=1 e=5 

If u<v, then 

0'l<cr5, cr2<(T7, crs < CJe, 0-4 < ds 
for all t £ [0, 1], while if < n, then 

0"l<0-6, 0'2<0"8, Cr3<<77, Cr4 < (75 

for all t E [0,1]. Each of these inequalities is an easy consequence of the 
following inequalities, which are valid for every extreme- value copula C, 
associated dependence function A and real numbers u,v,t £ [0, 1]: 

uv<C{u,v), tV{l-t)<A{t)<l, 0<fi{t)<l, 0<i/(t)<l. 

The bounds on and iy{t) stem from the fact that A is convex, ^(0) = 
A{1) = 1 and A{t) G [-1, 1] for every t G (0, 1). 
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APPENDIX G: WEIGHTED BIVARIATE EMPIRICAL PROCESSES 

Let {Ui, Vi), . . . , {Un, Vn) be a random sample from an arbitrary bivariate 
copula C and for all n, u G [0, 1], define 

1 " 

Cn{u,v) = - y^l{U^<u,V^<v). 

n ^ 

1=1 

The purpose of this appendix is to characterize the asymptotic behavior of 
a weighted version of the empirical process a„ = n^/^(C„ — C). Specifically, 
fix (J > 0, and for every x € [0, 1], let q^{t) = - i)"^ and 

, : \ , ifnA^;e 0,1, 
0, if n = or t; = or {u, v) = (1, 1). 

The following result, which may be of independent interest, gives the 
weak limit of the weighted process G„.^ in the space ^°°([0, 1]^) of bounded, 
real-valued functions on [0, 1]^ equipped with the topology of uniform con- 
vergence. Weak convergence is understood in the sense of Hoffman- J0rgensen 
[van der Vaart and Wellner (1996), Section 1.5]. 

Theorem G.l. For every uj e [0,1/2), the process ^n.uj coTivcTges WGCikly 
in -^^([0,1]^) to a centered Gaussian process G^^ with continuous sample 
paths such that Gi^{u, v) = if u = 0, v = or {u,v) = (1, 1), while 

, , C{u Au',vA v') - C(n, v)C{u', v') 
quj[u A v)qi^{u' A v') 

ifuAve (0, 1) and u' Av' e (0, 1) . 

The proof of this result relies on the theory of empirical processes detailed 
in van der Vaart and Wellner (1996), whose notation is adopted. Let 

E = {(n, v) e[0,lf:0<uAv<l} = (0, 1]^ \ {(1, 1)}. 

For fixed (n, v) € E, define the mapping f(^u,v) : E ^ M by 

^ ' q[u A V) 

and consider the class = {f(u,v) ■ {u,v) G E} U {0}, where denotes the 
function vanishing everywhere on E. 

Finally, let P be the probability distribution on E corresponding to C and 
denote by P„ the empirical measure of the sample {Ui,Vi), . . . , (C/„, Ki)- The 
following lemma is instrumental in the proof of Theorem G.l. It pertains to 
the asymptotic behavior of the process G„, where for each / G 

Gnf = n'^\Fnf-Pf) 
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with 

Fn/=-Em,y,), Pf=[[ fiu,v)dC{u,v). 



Lemma G.2. The collection T is a P-Donsker class, that is, there exists 
a P-Brownian bridge G such that G„ ^ G as n ^ oo in i°°{J^). 

Proof. It is enough to check the conditions of Dudley and Koltchinskii 
(1994) reported in Theorem 2.6.14 of van der Vaart and Wellner (1996), 
namely: 

(a) is a VC-major class. 

(b) There exists F : E ^ M such that \f\<F pointwise for all / € and 



/ {P{F > x)}^/"^ dx < OO, 
Jo 



where P{F > x) = P{(s, t) G E : F{s, t)>x}. 
(c) J- is pointwise separable. 

To prove (a), one must check that the class of subsets of E given by 
{{u',v') € K: f{u' ,v') > t} with / ranging over and t over M forms a 
Vapnik-Cervonenkis (VC) class of sets; see page 145 in van der Vaart and 
Wellner (1996). Noting that each f inT can take only two values, one can see 
that this class of subsets coincides with the family of intervals (0,u] x {0,v] 
with {u,v) ranging over E. By Example 2.6.1 on page 135 of van der Vaart 
and Wellner (1996), the latter is indeed a VC-class. 

To prove (b), define F : E ^ M at every (s, i) G E by 

F{s, t) = 2{s-'^ V t"^ V (1 - s)-'^ V (1 - t)-'^}. 
Given that the marginal distributions of C are uniform on [0, 1] , one has 
(G.l) C{u,v)<uAv and 1 - C{u,v) < {I - u) + {1 - v) <2{1 - u Av), 
for all {u,v) G [0, 1]^. Now take {u,v), (s,t) G E. If s < n and t <v, then 

\fiu,v)is,t)\ = < 2(n A < 2{s A t)- = 2(s- V t^), 

while if s > M or t > w, then 

< (1 - s V t)~^ = (1 - s)-^ V (1 - t)-'^. 
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Hence, for every / € ^ and every {s,t) € E, one has \f{s,t)\ < F{s,t). Fur- 
thermore, P{F > x) < 4(x/2)-^/'^ A 1 for every x > 0, so that the condi- 
tion uj < 1/2 ensures the integrabihty condition. Note in passing that since 
Pf = 0, one has Gnf = n^^^Fnf for all / G J^. Hence, as F is an envelope 
for JF, every sample path / i— > G„/ is an element of i°°{J-). 

To prove (c), let Eq be a countable, dense subset of E, and let Q be the 
countable subset of T consisting of the zero function and the functions f(^u,v) 
with {u,v) € Eq. Clearly, every f £ is the pointwise limit of a sequence in 
Q. Furthermore, the envelope function F is P-square integrable, and hence 
pointwise convergence in J- implies L'^{P) convergence. According to the 
definition at the bottom of page 116 of van der Vaart and Wellner (1996), 
this implies that is indeed a pointwise separable class. □ 



The limit process G whose existence is guaranteed by Lemma G.2 is a 
tight, Borel measurable element of £°°{J-) with Gaussian finite-dimensional 
distributions. To establish Theorem G.l, the idea is now to write G^ ^ — 
T{Gn) and G^; = T{G) as images of a continuous mapping T:£°°(JF) 
£°°{[0, 1]^) and to invoke the continuous mapping theorem. 

To this end, introduce </> : [0, 1]^ — > J^, which maps {u, v) € [0, 1]^ to 



[u,v 



f{u,v), ifuAwe(0,l), 

0, if li = or w = or {u, v) = (1, 1). 



Now let T:^°°(J^) ^^°°([0,1]2) be defined by T{z) = zo^ioi all z G ^°°(^). 
One has T(G„) = G„^a; because if {u,v) € [0, 1]^ with u Av £ (0,1), then 

T(G„)(n,^) =G„/(„„) =nV2(p„ _ p)hM^imZ^h^ = G^^Un,v), 
^ q[u A V) 

while if tt = 0, f = or (n, v) = (1, 1) then 

r(G„)(n,f) = G„0 = = Gn,ujiu,v). 

The map T is linear and bounded; it is thus continuous with respect to 
the topologies of uniform convergence on £°°(J^) and 1]^). As for the 

map (p, it is continuous with respect to the Euclidean metric on [0, 1]^ and 
the standard-deviation metric p on JF, defined implicitly by 

p\f, g) = E{(G/ - Ggf} = Pf - 2Pfg + Pg\ 

Indeed, the continuity of cp is easily derived from the bounds (G.l), together 
with the fact that for f[u,v)jf{s,t)^^ ^ -^j 

2 Ciu,v){l-Ciu,v)} 
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and 



P {f{u,v), f{s,t)) 




( 



C{u,v) _ C{s,t) 
q{uAv) q{sAt) 



) 



2 



The scene is finally set for an application of the continuous mapping 
theorem [see Theorem 1.3.6 in van der Vaart and Wellner (1996)]. 

Proof of Theorem G.l. The continuous mapping theorem implies 
that r(G„) r(G) in £°°([0, l]^) as n ^ oo. The limit process = r(G) 
is thus a tight, Borel measurable element of ^°°([0, 1]^) with the desired finite- 
dimensional distributions. Furthermore, (p is continuous, and according to 
the statements on page 41 of van der Vaart and Wellner (1996), the sample 
paths / I— > G/ of G are almost surely uniformly continuous with respect to p. 
Therefore, the sample paths {u,v) i— > G(^(ti, w) = G(j){u,v) are almost surely 
continuous, as well. Hence, the process G^; has all the stated properties. □ 
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